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The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice 
variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles 
and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction 
(qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently 
available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, 
functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that 
uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration 
of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific 
(covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection ac- 
cording to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime 
primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. 
Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups 
of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. 

Database URL: http://deplanckelab.epfl.ch. 



Background 

Large-scale genomic approaches have demonstrated exten- 
sive alternative splicing in humans and other model organ- 
isms (1, 2), and current gene models are continuously 
updated to include additional splicing events (3). The regu- 
latory mechanisms underlying alternative splicing, as well 
as the biological significance and function of individual 
gene splice variants are, however, still very poorly under- 
stood (4). This is in large part due to the fact that simple 
tools allowing the analysis and quantification of individual 
splice variants with high sensitivity are lacking. High- 
throughput quantitative real-time polymerase chain reac- 
tion (qPCR) is an ideal technique to accurately quantify nu- 
cleic acid sequences including splice variants. In addition, it 



complements gene expression analyses done by microarray 
or deep sequencing because the latter analysis methods are 
still less efficient in terms of overall cost and computational 
expertise required than qPCR for the quantitative detection 
of gene transcripts, especially those that are lowly ex- 
pressed such as many transcription factor (TF) genes (5, 6). 
The choice of suitable primer sets is thereby critical to 
obtain optimal qPCR results. An ideal qPCR primer design 
program should at least include the following features: 
first, given the above mentioned increasing interest in 
understanding the role of individual gene splice variants 
(1, 7, 8), the program needs to take into account all anno- 
tated splice variants of each gene to enable either gene- 
(covering the majority of splice variants) or transcript- 
specific (covering one splice variant) expression profiling; 
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second, at least one primer needs to span exons to avoid 
amplification of contaminating genomic DNA; third, the 
specificity of each primer needs to be automatically evalu- 
ated by similarity search; fourth, no cumbersome post- 
processing should be required to retrieve the best primer 
combination and fifth, the location of primers pairs within 
their genomic context should be visualized for easy, final 
evaluation by the end user. 

In search of qPCR primer design software for a large- 
scale TF gene expression profiling experiment, we con- 
sidered several software packages but none of them 
fulfilled the stipulated requirements as these programs 
all varied in quality, functionality or throughput mode 
(Table 1). The most popular interface, based on Primer3 
(9), named Primer3Plus (10), allows the user to define a 
variety of possible parameters and options for designing 
oligonucleotide primers. However, the use of this program 
is time-consuming because users have to manually process 
the large number of proposed primers when, for example, 
verifying primer specificity by BLAST (11). Another pro- 
gram, RASE (12) generates qPCR primers for the detection 
and quantitation of specific splicing isoforms, but does not 
enable the design of gene-specific primers. In addition, its 
associated web interface only supports low-throughput 
experiments. Other programs such as PerlPrimer (13), 
QuantPrime (14), and BatchPrimer3 (15) do allow batch 
primer input, and some databases of qPCR primers includ- 
ing Quantitative PCR Primer Database (16), RTPrimerDB 
(17), PrimerBank (18) and qPrimerDepot (19) were de- 
veloped for high-throughput primer design or retrieval. 



But again, none of these packages combines and auto- 
mates all of the important features required to address the 
increasing demands in qPCR primer design for high- 
throughput qPCR experiments, especially the requirement 
to target genes in gene- or transcript-specific fashion with- 
out post-processing (Table 1). 

To fill this current void, we developed our own qPCR 
primer design software, GETPrime. This program was de- 
signed to generate primers targeting every gene available 
in the latest Ensembl release, which is used as a reference 
resource (20). However, to allow fast primer retrieval, we 
have linked our program to GETPrimedb, a database en- 
abling fast retrieval via a user-friendly interface of gene- 
or transcript-specific primers for all Homo sapiens, Mus 
musculus, Caenorhabditis elegans, Drosophila melanoga- 
ster and Danio rerio genes in assembled chromosomes 
annotated in the Ensembl database. 

Database construction and 
development 

Primer generation 

GETPrime combines several existing tools: the PerlPrimer 
program (13), Blast (11) and the Ensembl database through 
a custom perl wrapper, which enables automation of the 
workflow and decision process for selecting the best primer 
pairs (see workflow, Figure 1). 

The first step in the GETPrime pipeline is the selection of 
a transcript, as an input, from the Ensembl database for 
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each target gene of interest. For comprehensiveness, we 
included all genes featuring a 'known' or 'novel' status in 
the Biomart portal in the GETPrime database (21). Each 
gene-specific transcript is then selected based on two inde- 
pendent criteria: (i) its status must be annotated as 
'KNOWN' because we opted not to include 'novel' tran- 
scripts in our database as these are not yet validated by 
species-specific sequencing data (Bert Overduin from 
Ensembl Project, Personal communication) and (ii) the high- 
est junction score (Figure 2). In the first round, this score 
allows the selection of a transcript that contains conserved 
exon junctions within the alternative splice variants. If the 
same highest score is obtained by several transcripts, then 



the transcript with the shortest sequence is selected. Next, 
this first selected transcript is provided as an input to our 
modified PerlPrimer program. This program supports 
primer design for one specific transcript at exon junctions 
to avoid unspecific amplification due to DNA contamin- 
ation. Using a graphical user interface, PerlPrimer runs 
Spidey (22) to detect intron/exon boundaries, and searches 
all possible primer pairs on the input transcript. To enable 
the generation of primers for a large number of genes, we 
modified PerlPrimer to use the exon junction coordinates 
supplied by Ensembl. In addition, in contrast to PerlPrimer, 
that frequently generates tens of candidate primer pairs 
without quality scores, GETPrime runs an extensive 
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Figure 1. Overall primer design pipeline. The overall workflow is depicted. The green box is explained in Figure 2. The pink and 
purple boxes are explained in more detail in Figures 3 and 4, respectively. For more details, please see main text. 
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Figure 2. Visualization of the junction score algorithm concept for gene- or transcript-specific primer design. A schematic 
representation of a gene with three alternative transcripts is depicted. The blue boxes represent exons, the lines represent 
introns. Each transcript contains a subset of exons A-D and A'. The junction score (A/) constitutes the number of transcripts 
containing the respective splice junction. To design gene-specific primers, the sum of the junction scores, represented by S, is 
calculated for each transcript and the transcript with the highest S-value that contains the junction with the highest A/-score is 
selected (here t-i). Then, the gene-specific primers are preferentially designed so that one of the primers spans the exon junction 
with the highest A/-score within the selected transcript (t|), as indicated by the dark blue arrows. 
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workflow to select the three best primer pair candidates 
based on stringent parameters (Table 2) and well-defined 
criteria which allow primer pair ranking (Figures 1 and 3). 

The first important criterion (Figure 3) is primer specifi- 
city, which is assessed using BLASTN against the entire 



genomic DNA and cDNA from all predicted transcripts. To 
ensure primer specificity in BLAST, we cancelled the default 
filter, as this allows the detection of spurious alignments 
even outside biologically relevant genomic regions. This 
is also the reason why we implemented a low stringency 



Table 2. Comparison between default and relaxed primer design parameters 



qPCR primer quality criteria 


Default parameters 


Relaxed parameters 


Primer length 


19-25 bp 


19-25 bp 


Amplicon length 


80-200 bp 


60-300 bp 
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57-60°C 
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At least 7 bp at the 5'-end and 




3 bp at the 3'-end 


3 bp at the 3'-end 



The melting temperature (7~ m ) is calculated in the PerlPrimer program (13) which uses J. SantaLucia's extensive 
nearest-neighbor thermodynamic parameters (38, 39) and the default salt conditions (1.5 mM Mg 2+ , 200 mM oligos, 
0.2 mM dNTPs and 50 mM monovalent cations). Bolded text highlights the differences between the default and relaxed 
primer design parameters. G = guanine;C = cytosine. 
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Figure 3. Workflow to select the best primer pairs. The selection of the best primer pairs is automated according to these 
hierarchical criteria. First, each primer pair is blasted and potentially discarded as described in the Figure and in the main text. 
Then, with the remaining primer pairs, pairs are discarded if at least one primer spans the 5'- or 3'-UTR. After these two stringent 
filtering steps, the remaining primers are ranked according to (i) highest transcript coverage, (ii) whether the primers are located 
within the same exon (not desirable) or not (desirable) and (iii) smallest amplicon size which has shown to be more optimal for 
qPCR efficiency and experimental variation (35). 
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£-value of 100, which assures that all possible alignments 
with primer sequences are detected. If both primers match 
with at least 60% sequence identity, a DNA or cDNA region 
spanning at most 5000 bp and the region does not corres- 
pond to the position of the targeted transcript itself, then 
the primer pair is discarded because it may wrongly target 
another gene or a pseudogene. The primer pairs that 
are similar (at least 50% overlap for both primers) to the 
one just discarded, usually shifted by 1 to 2 bases, are also 
discarded to avoid running too many BLAST searches. 
The BLAST criterion is especially important to specifically 
monitor the expression of protein-coding genes from the 
same family such as, for example, homeodomain TFs. The 
second criterion is to discard primer pairs that span the 
5'- or 3'-untranslated regions (UTRs). Transcript quantifica- 
tion based on primers targeting these regions can be biased 
as 3'-UTRs contain multiple polyadenylated regions and 
5'-UTRs are frequently absent or truncated if cDNAs are 



synthesized with an oligo(dT) primer. Other ranking criteria 
are the number of gene-specific transcripts that are covered 
(as reflected by the value of N shown in Figure 2), the size 
of the amplicon and whether the primer pair falls within 
the same exon. 

If, at the end of the ranking pipeline, no primer pair has 
passed the selection, then certain parameters are relaxed 
until a satisfactory primer pair is obtained. Parameter 
relaxation is performed within the modified PerlPrimer 
script and within the best primer pair selection workflow 
(Figure 4). In the modified PerlPrimer script, the first par- 
ameters that are relaxed are the amplicon length and the 
melting temperature difference (Table 2), followed by the 
exon/exon junction criteria (allowing <7bp at the 5'-end 
and/or <3bp at the 3'-end) and the requirement to span 
different exons. In the best primer pair selection workflow, 
parameters that can be relaxed are the extent of primer 
specificity and the location of primers in the UTR regions, 



Input 




"Nonspecific", "In UTR" and 
"DesignNoJunction" warnings 



Figure 4. Workflow to find at least one suitable primer pair by relaxing the primer design parameters. The circles schematize the 
run of the modified PerlPrimer script and the workflow of the best primer selection indicated in Figure 1. The relaxation of the 
parameters within the modified PerlPrimer script and the allowed options in the selection of the best primer pairs are depicted 
in the center and on the right of the circles, respectively. Blue and green circles represent the default parameters and the relaxed 
design parameters, respectively (Table 2). The arrows symbolize the logical flow. If no primers are found with either set of 
parameters, the program reports 'No primers'. 
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which sometimes constitute the only sites which enable the 
distinction of two or more transcripts. Primers that are gen- 
erated under these relaxed conditions are tagged, and the 
user is informed on which parameters were changed to 
obtain primers. Genes for which no primers could be 
found, even after parameter relaxation, are labeled with 
the statement 'no primers'. 

The three top-ranking primer pairs are saved in the 
output file along with a list of transcripts covered by the 
primer pair, the sequences of forward and reverse primers, 
the primer melting temperatures, the start and end primer 
positions, the Ensembl status of the gene ('known' or 
'novel'), indication of whether parameters were relaxed 
to obtain the primer pair, and a link to the graphical 
view. Finally, potential transcripts not covered by the top 
ranking primer pair are reanalyzed to design additional 
primer pairs covering the transcripts from the remaining 
set. This process is repeated until all predicted transcripts 
are covered by at least one primer pair. The results of this 
computational exercise for all annotated genes of the se- 
lected genomes are stored in a General Feature Format 
(GFF) file and saved in a MySQL database. 

Furthermore, our pipeline has been extended for 
transcript-specific expression profiling by changing the 
workflow to select the best primer pairs. Instead of ranking 
primer pairs according to the highest transcript coverage, 
the ranking is done according to the smallest transcript 
coverage. The average number of genes in Ensembl release 
61 covered by the best ranked gene- or transcript-specific 
primer pairs for each model organism is listed in Table 3. 
For example, 98% of the known Mouse Ensembl genes are 
covered by a gene- and transcript-specific primer pair. Of 
them, 50% have no associated warning and 2% have been 
obtained by relaxing primer parameters [amplicon length 
and deviation in the melting temperature (Table 2)] 



without any other type of warning. The overall computing 
time depended on the number of genes processed and 
varied per model organism, taking between two days 
(D. melanogaster) and two weeks (H. sapiens) on our 
server [Linux system (kernel 2.6.18) with 48 x Intel Xeon 
2.67 GHz quad CPU with 74GB RAM memory]. 

Database access 

Google Web Toolkit was used to generate the web inter- 
face and to display the MySQL query results directly in the 
browser. The interface to retrieve primer pairs accepts gene 
symbols, also defined as 'associated gene names' by the 
BioMart portal (21), and Ensembl gene or transcript IDs. A 
choice is available to either find primers covering most of 
the transcripts of a gene (for maximum coverage with a 
minimal number of specific primers) or, if possible, to 
have individual primer pairs specific for each single tran- 
script for detailed quantification of splice variants. The 
interface contains different filtering possibilities to obtain 
the desired primer output. For example, one can select only 
primers without warnings, or decide to include primers 
with specific warnings that are tolerated by the individual 
user (e.g. 'inUTR'). A typical query example featuring visual 
cues at each processing step has been added to the inter- 
face to assist users. The final output of the GETPrime inter- 
face is an Excel file containing primer sequences and 
parameter properties, as well as hyperlinks directing 
the user to an in-house browser, based on J Browse (23), 
showing the alternative transcripts and primer positions 
(Figure 5). The current GETPrime database is based on the 
Ensembl release 61. A previous version of GETPrime based 
on Ensembl release 50 is still accessible to the user on the 
same web interface. The only differences between the two 
versions are that, in the current version, novel genes are 



Table 3. Average number of genes covered by the best ranked primer pair for each species 
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8997 


3119 


3151 


Mus musculus 


29445 


28840 (97.9) 
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Caenorhabditis elegans 


38 237 


21 964 (57.4) 


17510 
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833 
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Drosophila melanogaster 


14869 


14 368 (96.6) 


8493 


301 


427 


1241 


2359 


376 


1171 


Danio rerio 


24 370 


24 203 (99.3) 


18091 


487 


499 


899 


1261 


2174 


792 



We calculated the average number of gene- or transcript-specific primer pairs with their principal stringencies. 
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Figure 5. J Browse-based graphical view of GETPrime primer pairs targeting the Rbbp9 mouse gene. The blue boxes on the left 
are the available tracks that can be dragged in the JBrowse genome view (23). In this example, the transcripts, the gene-specific 
primers (covering the majority of splice variants if possible) and the transcript-specific primers (covering a single splice variant, 
when possible) have been dragged into the browser. The upper part of the figure shows tools to zoom, to move to up- or 
downstream of the genome location, and to enter another chromosome, another position on the chromosome or also an 
Ensembl ID. Each primer is annotated by its Ensembl ID, its iteration in GETPrime (e.g. -1), its ranking (e.g. _3) and its primer 
type (forward and reverse primers are abbreviated Fwd and Rv, respectively). The blue box for each primer represents the 
respective alignment to the transcripts and sometimes a thin line between two blue boxes is used to bridge an intron region 
for primers spanning two exons. The primer pairs in the gene-specific track cover both transcripts. The primer pairs from the first 
iteration ('-1') and the second iteration ('-2') in the transcript-specific track are specific to the largest transcript Rbbp9-001 and 
the shortest transcript Rbbp9-002, respectively. 



included and that hyperlinks for each primer pair are sup- 
ported by distinct browser types. 

Database validation and application 

The sensitivity and specificity of the primers generated 
by GETPrime were validated experimentally. First, as a con- 
trol experiment, we selected three commonly used, and 
thus, already published primer pairs, each targeting one 
of the following control genes: Hprtl, Igfbp4, Tubb2c (see 
Supplementary Table S1 for sequences). Only primer pairs 
covering exactly the same transcripts as the one generated 
by GETPrime were selected. Similar to the well-established 
primer pairs, all three GETPrime primer pairs were of high 
quality showing high specificity based on melting curve and 
gel separation analyses as well as good amplification effi- 
ciency on cDNA synthesized from a qPCR Mouse Reference 
total RNA (a mean efficiency of 103% with an /? 2 -value of 
0.997, see 'Materials and Methods' section). In addition, the 
amount of PCR product generated by each primer pair in 
pre-adipocyte 3T3-L1 cells pre- and post-differentiation 
induction ('Materials and Methods' section, DO, D2 and 
D4) was comparable (AC q <1). 



Next, we evaluated the quality of GETPrime primers by 
targeting 60 TFs in cDNA samples synthesized from the 
same mouse reference total RNA ('Materials and 
Methods' section). We chose TFs as they are typically ex- 
pressed at lower levels than non-TF-coding genes and are 
therefore in general more difficult to detect (24, 25), thus 
rendering this validation assay more stringent than when a 
number of genes is targeted coding for a diverse set of 
proteins. In total, 45 out of the 60 tested primer pairs 
were of high quality as evidenced by their high amplifica- 
tion efficiency (a mean efficiency of 98.94% with an R 2 - 
value of 0.994) and by their high specificity based on 
their corresponding dissociation curves. Of the remaining 
15 primer pairs, one was found to form primer dimers indi- 
cated by the presence of a clear melting curve peak in 
the no template and the no reverse transcriptase negative 
controls, and 14 only yielded a low signal likely due to the 
fact that the corresponding target genes were only lowly 
abundant both in the Reference RNA as well as the 3T3-L1 
RNA samples (26) ('Materials and Methods' section). 
However, we were able to generate a standard curve for 
five out of the 14 primer pairs because the corresponding 
TF open reading frame (ORF) clones are available in our 
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laboratory. These primer pairs yielded an amplification ef- 
ficiency mean of 98.4% with a mean /? 2 -value of 0.995. 
Thus, 50 out of 51 primer pairs that we were able to test, 
satisfy the qPCR reliability criteria, demonstrating that the 
vast majority of primers in the GETPrime database are of 
high quality. 

To further evaluate primer specificity, we designed a 
stringent assay with GETPrime primers which target one 
TF ORF within a library of TF ORF clones producing hom- 
ologous proteins, here, either homeodomain- or ZF-C2H2 
TFs ('Materials and Methods' section and Supplementary 
Table S2). For each of the selected TF targets, we obtained 
a clear qPCR amplification signal in contrast to when the 
same TF ORF library was used as qPCR template but without 
containing the respective target TF ORF (Supplementary 
Figure S1), indicating that the respective primer pair is 
highly target-specific. Finally, to evaluate the ability of 
GETPrime to differentiate between gene-specific tran- 
scripts, we used 3T3-L1 cells pre- and post-differentiation 
induction and chose one relatively straight-forward splicing 
scenario featuring one gene, Ubtf, that, according to 
Ensembl release 50, has two distinct transcript forms 
Ubtf_a and Ubtf_b, each representing respectively five 
and two different splice forms (Figure 6A). GETPrime pri- 
mers were able to differentiate both Ubtf splice forms at 
the two selected differentiation time points and did so in 
quantitative fashion in that the sum of the individual tran- 
script amounts matched the overall gene expression 
amount (Figure 6B). 

Discussion 

The experimental results demonstrate the power of 
GETPrime to produce gene- or transcript-specific qPCR pri- 
mers. The results also show that the generated primers are 
of high quality and that these primers are able to detect 
low-abundant transcripts such as those coding for TFs. 
Moreover, they demonstrate their capacity to specifically 
recognize targets within a pool of templates coding for 
highly homologous proteins, as well as their high amplifi- 
cation efficiency. Thus, given the fact that there are, to our 
knowledge, no other software and web tools that offer the 
same set of attributes as the GETPrime platform (Table 1), 
we believe that GETPrime constitutes an important advance 
of the qPCR primer design field. One other recently de- 
veloped qPCR primer design software QuantPrime (14) 
also features gene-specific primer design (i.e. covering the 
majority of splice variants) as an option. However, this soft- 
ware offers no straight-forward way to identify which tran- 
scripts are covered by the gene-specific primers. The user is 
therefore obliged to blast each primer in the Ensembl data- 
base to find this information, which is time-consuming, es- 
pecially if a large set of genes need to be targeted. In 
addition, it is often impossible to generate primers that 



cover all gene transcripts. For these genes, multiple 
primer pairs need to be used to yield a gene-specific 
read-out. In contrast to QuantPrime, the GETPrime data- 
base provides all this information in easy-to-retrieve and 
graphical fashion. Thus, while it is clear that QuantPrime 
is a powerful primer design program, given its great par- 
ameter flexibility for large-scale qPCR primer design and 
user-friendly interface, it appears not to have been de- 
signed with the a priori aim of yielding gene- or 
transcript-specific primers. Moreover, we found that 
QuantPrime processing time significantly increases with 
increasing gene length and number. GETPrime does not 
suffer from this drawback as its database interface allows 
direct primer retrieval. This is similar to other Primer data- 
bases, like qPrimerDepot (19), PrimerBank (27) and 
RTPrimerDB (17), which also provide pre-computed qPCR 
primers. The limitations of these databases, though, are 
that they do not accommodate primer retrieval in batches 
and that they list multiple primer pairs per gene. It is then 
up to the individual user to evaluate each primer pair for 
experimental suitability, which is cumbersome, in contrast 
to GETPrime, which ranks all primers according to 
well-defined criteria. An important GETPrime drawback is 
that it is so far available only for five commonly used model 
organisms including humans. However, demands to design 
primers for other organisms of interest can be accommo- 
dated or can even be performed by the end user via adjust- 
ment of the GETPrime Perl script, which is available upon 
request. Thus, the GETPrime database currently includes 
primers for H. sapiens, M. musculus, C elegans, D. melano- 
gaster and D. rerio genes in assembled chromosomes anno- 
tated in the Ensembl database release 50 and 61. We 
thereby plan to update GETPrime as soon as major new 
Ensembl releases are available. 

Materials and Methods 

Cell culture 

3T3-L1 mouse fibroblast cells (28) were cultured in DMEM 
supplemented with 10% fetal bovine serum, with L-glutam- 
ine 2mM and penicillin/streptomycin (1x) in a 5% C0 2 
humidified atmosphere at 37°C and maintained <80% con- 
fluence before passaging. Differentiation of 3T3-L1 cells 
was induced by exposing two-day post-confluent cells 
[designated as Day 0 (DO)] to DMEM containing 10% FCS 
(Bioconcept, Allschwil, Switzerland) supplemented with 
1 \M dexamethasone, 0.5 mM 3-isobutyl-1-methylxanthine 
and 1 jig/ml insulin (Sigma, St Louis, USA). At D2, cells were 
fed with DMEM containing FCS and 1 jig/ml insulin and two 
days later (D4), the media was changed to 10% FBS/DMEM. 
Full differentiation is usually achieved by Days 6-8. The DO, 
D2 and D4 samples have been used for RNA extraction, 
cDNA preparation and qPCR. 
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Figure 6. Graphical view and qPCR results to validate C/bff-targeting primers covering either all or a subset of Ubtf transcripts. 
(A) The 'Ubtf primer pair in blue covers all seven transcripts (gene-specific primers) and the red 'Ubtf_a' and 'Ubtf_b' primer 
pairs cover five and two transcripts, respectively (transcript-specific primers). In this example, GETPrime could not find primers 
differentiating each transcript. (B) The relative gene expression levels before differentiation (DO) and four days after (D4) were 
normalized to Hprtl and Tubb2c expression levels. 'Ubtf represents the primer pair covering all seven transcripts, whereas, 
'Ubtf_a' and 'Ubtf_b' are primer pairs specific to a subset of five and two transcripts, respectively. 'Ubtf_a+Ubtf_b' represents the 
sum of relative gene expression of 'Ubtf_a' and 'Ubtf_b'. The data indicate that GETPrime can effectively differentiate distinct 
transcripts, as the sum of the individual transcript amounts matched the overall gene expression amount. 



RNA extraction and cDNA synthesis 

We have used two different experimental samples. First, to 
analyze the quality of the primers, we used Agilent's qPCR 



Mouse Reference total RNA (Agilent technologies, Santa 
Clara, USA). For the other reported experiments, we used 
total cellular RNA isolated from 3T3-L1 cells using the 
RNeasy Plus Mini Kit (Qiagen) according to the 
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manufacturers' instructions without DNAse treatment. 
After extraction, RNA was quantified using a NanoDrop 
Spectrophotometer 1000 v3.2.0 (NanoDrop Technologies, 
Detroit, USA). The RNA quality was further determined 
using a nanodrop (1.8< A260/A280 < 2.2) and by visual in- 
spection of separated bands on agarose gels. A quantity 
of 1 \xg of RNA was used for the reverse transcription 
performed with random primers (Invitrogen) and 
Superscript III First Strand synthesis Supermix (Invitrogen, 
Carlsbad, USA) in a total volume of 20jil according to sup- 
plier's recommendations. The cDNA samples were stored 
at -20°C. 

qPCR 

qPCR was performed in 96- (manual) and 384-well plates 
(robotic) with three technical replicates on the ABI-7900HT 
Real-Time PCR System (Applied Biosystems, Foster City, 
USA) using Power SYBR Green Master Mix (Applied 
Biosystems, Foster City, USA) using standard procedures. 
Briefly, the standard protocol from Applied Biosystem 
was used (50°C for 2min and 95°C for 10min; then 
40 cycles of 95°C for 15s and 60°C for 1 min) and finalized 
by a dissociation step (95°C for 1 5 s, 60°C for 1 5 s and a ramp 
rate of 2% to 95°C). The amount of DNA, primer and Power 
SYBR Green Master Mix are indicated in the Supplementary 
Table S1. 

qPCR amplification efficiencies were calculated using the 
qPCR instrument software and were based on the linear 
regression of five serially diluted samples (a 4-fold dilution 
series). The slope of the standard curve gives the amplifica- 
tion efficiency by the formula E= io ( - 1/slope) . If the amplifi- 
cation is 100% efficient (percentage expressed by: £-1), 
then the amount of PCR product should be doubled per 
cycle, resulting in an £-value of 2. Primers were considered 
reliable if they featured efficiency values between 92% and 
108 % with a correlation coefficient, R 2 (i.e. how well the 
standard curve regression line fits the data), >0.99. Primers 
were specific in targeting the gene or transcript of interest 
if qPCR melting curve analysis yielded a single sharp dissoci- 
ation peak. In rare cases, a specific amplification reaction 
showed a so-called 'shoulder peak' (29) which occurs within 
amplicons containing multiple melting domains with vary- 
ing Guanine-Cytosine (GC) contents. When additional peaks 
(off-target or primer-dimers) or shoulder peaks were found 
in the melting curve, the specificity of qPCR products was 
also assessed by gel separation. A no template (to detect 
primer dimer formation) and no reverse transcriptase (to 
exclude DNA amplification) negative controls were also 
included in the presented qPCR assays. 

In the standard curve analysis, genes having C q -values 
in at least three dilution series >33 were considered as 
lowly expressed and were excluded from the analysis of 
calculating the average amplification efficiency. When an 
ORF clone was available for one of these genes, the primer 



quality assessment was done on five aliquots of a 4-fold 
dilution series of the clone of interest (starting amount 
~1 pg). 

Expression in 3T3-L1 was quantified using the AAC t - 
method and the data were normalized to Hprtl and 
Tubb2c expression. The expression of both these genes re- 
mains the most stable during 3T3-L1 cell differentiation 
within a set of six tested candidate reference genes (Actb, 
Hprtl , Igfbp4, Knabl, Tubb2c, GusB), as found by the nor- 
malization in geNorm software (30). 

In our experiments, primers were retrieved from the 
GETPrime database based on Ensembl release 50 with the 
most stringent parameters if available, or with the slightly 
less stringent design method 'DesignRelaxed' (Figure 4 and 
sequences in Supplementary Table S1). Analysis of the 
standard curve, dissociation curve and results were done 
directly by using the software SDS 2.4 from Applied 
Biosystems. To allow qPCR data exchange, RDML files (31) 
were generated by using qbase PLUS software [http://www 
.biogazelle.com, (32)] and are available in the 
Supplementary Data. 

Testing of primer specificity within a family 

To validate primer specificity, we first generated two 
libraries containing 80 and 55 ORF clones encoding TFs be- 
longing to, respectively, the homeodomain and ZF-C2H2 
protein families (Supplementary Table S2). TFs from each 
family were selected based on their phylogenetic rela- 
tedness as reported in Refs (33) and (34), thus to make 
primer selection as difficult as possible. Next, we evaluated 
whether we could specifically target, respectively, seven 
(D/x4, HoxdlO, HoxcW, Pitx2, Barxl, \rx6, Hoxb6) and six 
randomly selected TF ORFs (Egr2, Zfp148, Zfp354c, Zfp451, 
Zfp688, Zscan20) within the latter libraries using GETPrime 
primers. To do this, we generated libraries with and with- 
out the selected target TF ORFs. 

Supplementary Data 

Supplementary data are available at Database online. 
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